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Process for producing antbracyclines and intermediates thereof 

The present invention pertains to a process for producing anthracyclines and interme- 
diates thereof by expressing in a foreign host a DNA fragment relating to the 
5 biosynthctic pathway of anthracyclines and, if desired, the intermediates obtained are 
converted to anthracyclines or aglycones thereof using non-producing mutant strains. 

Polyketidc antibiotics are a broad and variable group of compounds which are 
composed of poly-p-ketomethyiene chain [CHRO]^2o- A common feature of poly- 
10 ketides is their biosynthetic rouie which is similar to the biosynthesis of fatty acids. 
Katz, L. and Donadio, S. (1993) have recently published a review article concerning 
polykctidcs. As their structure the antibiotics of anthracycline group are aromatic 
poiyketidcs, the common structural body of which is 7,8,9,10-tetrahydio-5,12- 
naphthaccne kinone of the general formula (A) 




R4 O Rg f\j 



To this structural body one or more sugars and other substituents are attached. The 
25 structural body of the molecule, to which the sugars are attached, is called an 
aglyconc. Anthracyclines arc discussed more specifically e.g. in the article of A. 
Fujiwara and T. Hoshino (1986). Several anthracyclines are cytostatically active and 
thus they are of continuous interest. 

30 To find new anthracyclines screening of Streptomyces bacteria from the soil and 
mutation thereof arc used. To modify known anthracyclines synthetic methods have 
been used, whereby chemical groups are added to or removed from either the aglycone 
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or the sugar moiety. Similarly, biotransformation is used, wherein in living cells 
molecules are modified which have been produced by other production strains or by 
synthetic methods. Some anthracyclines have also been produced by synthetic 
methods. 

5 

The hybrid antibiotic technology has been disclosed as a new technology in the 
preparation of new antibiotics. It has been established to comprise production by 
genetic engineering of molecules which have structural features of natural products of 
two strains. The process is described in the publication of H-G. Ross: "Hybrid 
10 antibiotics - the contribution of the new gene combinations" (1987). The hybrid 
antibiotic technology gives an opportunity to controlled production of new compounds. 

Cloning of actinorhodin genes from Streptomyces coelicolor (Hopwood et aL, 1985) 
can be considered as the pioneer work in the molecular biological study of polyketidc 

15 antibiotics and at the same time of strcptomycetcs. In 1987 Malpartida et al reported 
about the hybridization of different polyketide producers to the actl and acrtll DNA 
fragments and thereafter genes of the polyketide synthase (PKS) domain have been 
identified in many Streptomyces ^ecies exploiting the homology. Sequencing of these 
genes has shown that the genes are strongly conserved and include three Open 

20 Reading Frames, ORF 1, 2 and 3. The products of these three genes arc needed for the 
formation of the linear polyketide bound to the enzyme complex. For the optimal 
formation of the correct product encoded by the PKS-gcncs five ORFs arc needed in 
tctraccnomycin (Shcn and Hutchinson, 1993). The sequenced aromatic PKSs arc given 
in Table 1. 

25 
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Table 1. Cioned and sequenced gene domains encoding polyketide synthase of 
aromatic polyketide antibiotics 



Strain 


Product 


Reference 


5. coelicolor 


aktinorhodin 


Fernandez-Moreno, M.A. et al. 
1992 

Hailam, S.E et aL 1988 


5. violaceoruber 


granaticine 


Sherman, D.H. et aL 1989 


S, glaucescens 


tetracenomycin 


Bibb, MJ. et aL 1989 


S. rimosus 


oxitctracycUne 


Kim, E-S. et aL 1994 


S. cinnamonensis 


monensine 


Arrowsmith, TJ. et aL 1992 


S. griseus 


griseusine 


Yu, T-W. et aL 1994 


S. roseofulvus 


frenolisine 


Bibb, M J. et aL 1994 



Polyketide synthase (PKS) is a multienzyme complex which functionally reminds the 
15 synthase of long chain fatty acids. The separate components of actinorhodin PKS are 
so called actORFl ketoacyi synthase (KS); actORF2 homologous to KS may effect on 
the length of the polyketide chain (McDaniel, R., et aL, 1993); actORF3 acyl carrier 
protein (ACP); actORFS ketoreductase (KR) and actORF4 cyclase/dehydrase, which 
may be responsible for the aromatization of the first ring. 

20 

The most part of the biosynthetic anthracyclines are formed via the aklavinone 
intermediate phase, whereafter the compound is glycosylated or it is modified by 
adding e.g. hydroxyl or methyl groups. Modifications can occur also after the 
giycosylation. The biosynthesis of aklavinone and anthracyclines which are further 
25 formed therefrom are described e.g. in "Advances in bioconversion of anthracycline 
antibiotics" (1989) of U. Grafc et aL, and in the references cited therein. The 
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biosynthctic route of ihe nogalamycin aglycone being formed of ten acetates is 
evidently analogous to the biosynthesis of aklavinone. (Figures lA and IB). 

DescriptioD of the invention 

5 

A DNA fragment cloned from Streptomyces nogalater can be used according to this 
invention to combine the different phases of the biosynthetic route of anthracyclines, 
whereby hybrid anthracyclines and precursors of anthracyclines can be produced. This 
happens by transferring the cloned DNA fragment to a Streptomyces strain which 
10 produces anthracyclines or, alternatively, to a non-producer of anthracyclines. 

The DNA fragment of Streptomyces nogalater including in the biosynthesis of 
anthracycline and being cloned according to this invention caused surprismgly 
production of anthracycline precursors in 5. lividans, a host which does not produce 
IS anthracyclines. On the basis of the structures of the compounds obtained, the DNA 
fragment was supposed to include most of the genes needed for the biosynthesis of 
anthracycline aglycones. By complementation of mutant strains, analyzing the hybrid 
products and sequencing the DNA fragments we have been able to show that the DNA 
fragment comprises 

20 - the activity responsible for the election of the starting unit which defines the side 
chain of the 9-position (S. galilaeus hybrid products), 
the polyketidc synthase genes, 

the gene of the enzyme wliich is needed for removing the hydroxyl in 2- 
position, (ketoreductase), 
25 - the methyl transferase gene needed for the carboxylic acid esterification, 
the mono-oxygenasc gene. 

This DNA fragment and anthracycline precursors produced by it have further been 
used to produce hybrid anthracyclines. 

30 

The present invention enables one to produce some known cytostatically active 
anthracyclines (auramycins) as well as prior unknown compounds. Use of the 
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polykctidc synthase of anthracyclincs in the production of hybrid anthracyclines has 
not been described previously, neither the change of the starting unit of polyketide 
synthesis by transferring genes to a foreign host. Further, there is no prior disclosure 
of the cloning of genes of the biosynthetic pathway of nogalamycin produced by 5. 
5 nogalater, or use thereof. 



The similarity of the biosynthetic genes of polykctidc antibiotics disclosed by 
Malpartida et ai (1987) was the starting point to the discovery of the biosynthetic 
genes of nogalamycin. The total DNA of S, nogalater being cleaved by suitable 
10 restriction enzymes was hybridized by the Southern-techniques to the actl probe, and 
thus two hybridizing DNA fragments were obtamed. In an optimal case a suitable 
probe shows one DNA fragment. The use of cross hybridization was, however, 
considered to be possible as a strategy in identifying the biosynthetic genes, because 
the signals were strong, 

15 

The strategy by which the DNA fragment according to the invention was found was 
the following: A fragment homologous to the actl fragment described by Malpartida 
et ai (1987) was isolated from i nogalater. Said homologous fragment and flanking 
DNA fragments were transferred into a S. lividans strain TK24. Altogether about 20 
20 kb (=kilobase, 1000 bases) were transferred in five fragments into a foreign host. Of 
these an about 12 kb DNA fragment, pSY15, causes the production of nogalamycin 
intermediates in S, lividans. The recombinant strain obtained was cultivated in a 
nutrient medium used for anthracycline producers and the product was extracted by 
suitable organic solvents. 

25 

DNA fragments according to the invention were transferred into Streptomyces strains 
described hereinafter as well as to 5. galilaeus mutants H028, JH003, H061, H036 and 
H039 given in Table 2, and expressed in them. Said DNA fragments can corres- 
pondingly be transferred to other mutants mentioned in the table, depending on what 
30 kind of products are desired. 
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Streptomyces lividans 66, strain TK24, rcstriction-modification-frce strain. 

Streptomyces galilaeus ATCC 31615, produces aklacinomycin. 

Mutants of Streptomyces galilaeus ATCC 31615 (cf . Table 2) (Ylihonko et aL, 1994). 

5 Table 2. The products of Streptomyces galilaeus mutants; abbreviations used: 
Akn=akIavinonc, aglycone moiety of aclacinomycins; Rhn=rhodosamine; 
dF=dcoxyfucose, CinA=Cincrulosc A; Rho=rhodinose 





Mutant 


Product 


Description of mutation 


10 


H028 


No produoioo 


Mutation in PKS-domain 




JH003 


Ho pfoductioo 


Mutation in PKS-domain 




H061 


2-OH-Akluionc add 


No removal of 2-OH 




H036 


Methyl ester of 
akliaosc acid 


The fourth ring does not get dosed 




H039 


IVVIdanriDaoe 
2)Aka-Rho-Rbo 


Amino sugar is missing. 


15 


H038 


AkB-Rbn 


Mutation in glycosylation 




H026 


Aks-Rhn-dF-Rbo 


Oxidoxcductase is missing 




H035 


Not identified 


Mutation in the glycosylation 




H054 


lyOm-Rho-dF-CittA 

2) Alm-dF-dF-CiiiA 

3) Altn-Rbo-dF-Rho 

4) Ato-Rho-dF 

5) Akn-dF-dF 


Amino sugar is missing 



20 When producing the starting product for biotransformation the host used is preferably 
S. lividans, because it does not itself produce coloured or cxtractable compounds in the 
growth conditions used. 

When producing an aglycone for biotransformation the bacterial strains producing 
25 anthracyclines or non-producing mutants thereof are preferably used, most preferably 
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non-producing mutants of 5. galilaeus being transformed with piasmid pSY15 (Fig. 3), 
carrying the above mentioned 12 kb DNA fragment. 

When converting the anthracyciine precursors obtained using the piasmid pSY15 to 
5 anthracyclines or their aglycones, 5. galilaeus mutants, e.g. strains JH003 or H028, 
which do not produce aclanibicin are preferably used. 



The DNA-constnictions according to the present invention can be constructed by 
ligating suitable DNA fragments from the domain as described to a suitable vector. 
10 Such a vector is preferably the high copy number piasmid pU486 capable to amplify 
in several strains of the genus Streptomyces (Ward et aL, 1986). 

To produce anthracyclines and their precursors strains carrying the pSYlS piasmid are 
grown preferably in growth media for Streptomyces bacteria, preferably in El- 

15 medium, to which thiostrepton has been added to maintain the piasmid carrying 
strains. The strains arc grown in conditions which are advantageous to the producing 
strain, e.g. in a shaker in bottles, or in a fermenter which is stirred and aerated. After 
a suitable cultivation time, preferably after 2-7 days the products are isolated 
according to methods described for bacterial metabolites, preferably e.g. extracting 

20 with a suitable solvent, e.g. toluene or chloroform. The extracted compounds arc 
purified with suitable methods e.g. by using column chromatography. 



Anthracyciine precursors arc converted to anthracyclines in strains naturally producing 
anthracyclines, or mutants thereof. Compounds similar to those naturally produced by 

25 the strain arc thus obtained, having methyl in their 9-position and hydrogen in their 
2-position. In biotransformations auramycinone produced by a 5. galilaeus strain 
carrying the piasmid pSY15 is most suitably used as the starting compound, or methyl 
ester of nogalonic acid produced by a strain carrying the same piasmid which naturally 
does not produce anthracyclines. In biotransformations most preferably non-producing 

30 mutants of anthracyciine production strains are used, e.g. mutant H028 or JH003. 
Biotransformation is effected most preferably by cultivating a strain in a suitable liquid 
production medium, e.g. in El -medium, and by addmg anthracyciine precursors in 
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suitable amounts. After a suitable time, e.g. 6 to 48 hours, most preferably 16 to 32 
hours, the anthracyclines so formed are extracted. 

The strains used for transformation (cf. also Table 2) are described in the following. 

5 

TK24 is a 5. lividans strain which in the growth conditions used does not produce 
coloured secondary metabolites. In other growth conditions it produces actinorhodin, 
which is an antibiotic differing very much from anthracyclines. The strain does not 
produce any anthracyclines nor their precursors. When characterizing the products of 
10 TK24/pSY15 on the basis of NMR-spcctrum compound I was obtained as the primary 
product, which is possibly an intermediate of anthracycline biosynthesis (cf. Scheme 

I). 

H028 is a mutant of Streptomyces galilaeus which docs not as such produce 
15 anthracyclines or their precursors. However, this strain can be used in biotransforma- 
tions to convert anthracycline precursors to products similar to aclarubicin. When 
characterizing H028/pSY15 products it was found that this strain produces auramy- 
cinone (Compound H), which is an anthracycline aglycone similar to aklavinone, as 
well as auraraycins which are glycosides of auramycinone, e.g. Compound HI. When 
20 hydrolyzing auramycins auramycinone is obtained, which also shows that the 
compounds produced are glycosides of auramycinone. Auramycinone is a useful 
precursor of anthracyclines, when new anthracyclines are produced by biotransformati- 
on. Auramycins have been described to be cytostatic anthracyclines having possible 
use in cancer chemotherapy. The use of H028/pSY15 for the production of these is 
25 new. 

H061 is a Streptomyces galilaeus mutant, which produces 2-OH-aklanonc add. This 
is evidently due to a mutation which prevents removal of the hydroxyl in 2-position. 
H061/pSY15 produces aklavinone, auramycinone and their glycosides similar to 
30 aclarubicin. According to the result pSYlS complements the mutation of H061 and 
comprises thus the gene encoding the 2-position dehydroxylase. This is useful in 
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producing new hybrid compounds when tiansfonncd to a strain the products of which 
naturally have hydroxyl or a mcthoxy group in 2-position. 

On the basis of the results pSY15 is useful in producing precursors of anthracyclines 
5 in strains which naturally do riot produce anthracyclines, or when producing hybrid 
anthracyclines in strains which produce anthracyclines, or in mutants thereof. With it 
the formation of 9-position side chain can be affected so that the strains which 
provide a two carbon side chain at this position, do produce compounds which have 
a one carbon side chain at said position. Possible strains producing anthracyclines 

10 which can be modified this way are e.g. S, galilaeus, 5L peucetius and i purpurascens. 
The anthracyclinc precursors produced this way are useful in producing new 
anthracyclines by biotransformations. pSYlS can also be transferred to a strain which 
normally produces compounds which at 2-position have hydroxyl or a methoxy group. 
Thereby compounds are obtained which have hydrogen at this position. pSY15 enables 

15 also one to produce previously described auramycmone and its glycosides by the new 
method. 



In the following the detailed embodiments of the invention arc described as examples 
of isolation of the DNA fragment from S, nogalater strain ATCC 27451, production 
20 of nogalamycin precursors in 51 lividans strain TK24, production of auramycinonc in 
the mutant H028 and their modification to anthracyclines in the mutant JH003. In 
addition, expression of the DNA fragments according to the invention in the mutants 
of the strain S, galilaeus is described, as well as the compounds produced by these 
strains. 

25 

The main products of the strains TK24/pSY15, JH003/pSY15, H028/pSY15 and 
H061/pSY15 were characterized. 
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Brief description of drawings 

Fig. lA Anthracyclincs produced by Streptomyces strains, and identified precursors 
thereof. (Starting molecule: propionate.) The numbers of S. galilaeus 
5 mutant strains producing the intermediates are given in parentheses. 

Fig. IB Anthracyclincs produced by Streptomyces strains having acetate as the 
starting molecule. 

10 Fig. 2 Restriction map of the 12 kb continuous DNA fragment cloned from S. 

nogalater genome. The figure discloses also the inserts contained in the 
pSY plasmids obtained. Plasmid pU486 has been used in preparing the 
pSY vectors. On the basis of sequence comparisons the following functions 
have been obtained for the open reading frames shown in the figure: 1 = 

15 ketoacylsynthase-acyltransfcrase. 2 = Chain Length Controlling Factor 

(CLF)» 3 = acyl transferring protein; A and B = regulatory genes, C = 
mono-oxygenasc, D = methyl transferase, E = ketorcductase. 



20 



Fig. 3 Structure of the plasmid pSY15. 

Fig. 4 NMR-spectrum of compound I. 

Fig. 5 NMR-spcctrum of auramycinone. 

25 Fig. 6 NMR-spcctrum of auramycinone-rhodosamine-deoxyfucose. 

Fig. 7 NMR-spectrum of auramycmonc-rhodinose-deoxyfucose. 
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Materials used 

Bacterial strains and plasmids 

The strain Streptomyces nogalater ATCC 27451 was used as the donor of genes. The 
5 Streptomyces bacterial strains used in this work as hosts are listed above. The 
treatments of 5. nogalater DNA were effected in the E. coli strain XLl-Blue (recAl, 
endAl, gyrA96, thi-1, hsdRll, supE44. re/Al, lac, [FproAB, laclZAMlS. TdIO (ttf)] 
(Stratagcne Qoning Systems, California). £. coli strains GM2163 (£. coli Genetic 
Stock Center, Department of Biology 255 OML, Yale Univcisity, New Haven, USA) 
10 and LE392 (Promega) were used in preparing the gene bank and m amplifying the 
phage DNA. 

In K coli the plasmids pUC18/pUC19 (Pharmacia Biotech) were used, and in Strepto- 
myces strains the plasmid pU486 was used (Ward et al., 1987; obtained from Prof. 
15 Hopwood. John Inncs Centre, UK). 

Nutrient media and solutions used 

TRYPTOlvfE-SOYA BROTH (TSB) 
20 Per litre: Oxoid Tryptone Soya Broth powder 30 g. 

YEME (Hopwood et al^ 1985., p. 239) 

Per litre: Yeast extract (Difco) 3 g, Bacto-peptone (Difco) 5 g, malt extract (Oxoid) 
3 g, glucose 10 g and saccharose 340 g. After autodaving 2 ml of sterile 2JM MgClj 
25 solution and 25 ml of 20% glycine arc added. 

SGYEME As YEME, but the amount of saccharose was 110 g per litre. To prepare 
protoplasts the amount of 20% glycine varies from 12 ml to 50 ml per litre depending 
on the strain used. 

30 

YM-agar Bacto Yeast malt extract agar, ISP-mcdium 2, Difco; 38 g/litre. 
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ISP4 



Bacto ISP-medium 4, Difco; 37 g/litrc. 



R2YE 



Hopwood €t al,, (1985 p. 236) 



5 



LB 



Sambrook et aL, (1989, 3:A.l) 



El 



Per litre: Glucose 20 g, starch 20 g, Farmamcdia 5 g, yeast extract 2.5 g, 



KjHPO^* SHjO 1.3 g, MgSO^^ THjO 1 & NaQ 3 g, CaCX), 3 g. Tap water is added 
to 1 litre and pH is adjusted to 7.4. 



20*SSC Per litre: NaQ 1753 g, Na-citrate 88.2 g. pH is adjusted to 7 with NaOH. 

15 DENHARDT SOLUTION (Sambrook et al, 1989, 3:B.l) 

A 50* basic solution is prepared, which contains Ficoll 5 g, polyvinyl pyrrolidone 5 
g, BSA (bovine serum albumin) 5 g. Distilled water is added to 500 ml and sterilized 
by filtrating. 

20 

Example 1. Qoning and cbaracterizatioD of the genes included in the 
anthracydine biosynthesis of Streptomyces nogataier 

11 Preparing of gene bank and cloning of anthracydine genes from S. 
25 nogalater. 

Isolation of the total DNA from Streptomyces nogalater 

5. nogalater (ATCC 27451) mycclia were cultivated for about 3 days in 50 ml of TSB 
medium, wherein 0.5% glycine had been added at 28 ^^C vigorously shaking. The 
30 mycelia were pelleted and the supernatant was discarded. The pellet was suspended 
into 10 ml of lysis buffer (15 % saccharose, 25 mM Tris, pH 8,0, 25 mM EDTA and 
5 mg/ml of lysozyme) and incubated for 15 min at 37 ""C. 1 mg of proteinase K and 



10 



TE 



Tris-HCl-buffcr, pH 8: 10 mM, EDTA, pH 8: 1 mM 
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1 ml of 10% SDS were added while stiiring. The mixture was incubated at once for 
15 min at 70 °C. The lyscd pellet was subsequently cooled in ice, 1 ml of 3 M Na- 
acetate (pH 6.0) was added and kept for a few minutes on ice bath. 5 ml of phenol 
balanced with 0.1 M Tris was added and stirred by turning the tube around. The 
5 phases were scntrifuged apart and the water phase was further extracted with 5 mi of 
chloroform. DNA was subsequently precipitated by adding 10 ml of isopropanol. DNA 
was spumed cautiously around a Pasteur pipette being closed by flaming, washed by 
dipping into 70% ethanoi and DNA was loosened onto the wall of the tube. DNA was 
dissolved in 5 ml of TE-buffer and treated with RNase (25 jil of 10 mg/ml DNase 
10 free RNase) for about 30 min at 37 **C. Tht phenol and chloroform extractions were 
repeated. DNA was subsequently rcprccipitated with isopropanol and washed as above. 
Finally DNA was dissolved in 1 ml of TE-buffer and it was used for subsequent steps. 

Southern hybridization 

15 The aal probe was the 0.8 kb 5g/II-fragment obtained from the plasmid pU2345 and 
the acm probe the 3 kb Ai/nHI-fragmcnt obtained from the plasmid pACM5 (Nicrai 
et aL, 1994). The plasmids were isolated at mini-scale (Magic Miniprcps reagent 
scries of Promega) and the probe fragments were isolated by preparative agarose gel 
electrophoresis after digestmg them first withBg/II and with Ai/nHI, respectively. The 

20 probes were then labeled with 50 |iCi of [a^-P]CTP by nick-translation (Nick 
translation labeling reagent scries of Boehringer Mannheim). 

The total DNA preparations isolated as described above were digested with EcdRl 
enzyme and fractionated with agarose gel electrophoresis. The fractionated DNA was 
25 transferred from the gel to Hybond N membrane (Amersham) using the Vacugcne 
apparatus (LKB 2016. Phaimacia LKB Biotechnology) accordmg to the instructions of 
use. DNA was fastened into the membrane by incubating for 3 min in UV light. 

The membranes were hybridized in 10 ml of hybridization solution (1% SDS» IM 
30 NaQ, 5* Denhardt's solution, 100 jig/ml denatured carrier DNA (DNA from calf 
thymus, Boehringer Mannheim) at 65 **C in a hybridization oven (HB-ID Hybridiser, 
Techne) for about 6 h, whereafter at least 100 ng of labeled probc-DNA was added 
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into the hybridization tube and the inkubation was continued for further about 12 h. 
After this the membranes were washed at 65 ''C for 2''30 min in a wash solution 
(2*SSC, 1% SDS or 0.2*SSC. 0.1% SDS). Autoradiography was effected by 
superimposing the membrane coated with a plastic Sim and the autoradiography film. 
5 Exposure lasted about 1 to 3 days. 

Preparing of the gene bank from S. nogalater DNA 

40 ^ig of DNA was incubated in the digestion buffer (10* A, Bochringer Mannheim) 
in the presence of 2,4 units of Sau3A (Boehringer Mannheim) for 5 min at 37 "C and 

10 the reaction was stopped by adding phenol. After phenol treatment DNA was purified 
with ethanol precipitation. DNA-fragments so obtained were run at preparative agarose 
gel electrophoresis (0.3 % LGT, low gelling temperature). DNA, which was 20 kb or 
bigger, was taken from the gel by cutting and purified by phenolization from the 
agarose. A commercial phage vector, X EMBL 4, BamlU fragments (Amersham Inter- 

15 national pic, Amersham UK) were treated with alkaline phosphatase (CIAP. calf 
intestinal alkaline phosphatase, Promega) according to the instructions of the 
manufacturer. The msert DNA {Sau3A fraction) and vector so obtained were ligatcd 
by incubating for 2 h at room temperature and for 2 h at 14 in the presence of T4- 
DNA ligase (Promega) according to the reconmiendation of the manufacturer. The 

20 ligation mixture was packed to X-particlcs using the Packagene reagent scries 
(Promega Biotech) according to the manufacturer's instructions. Escherichia coli strain 
GM2163 was used as the host. The cells were prepared for infection according to the 
packing instructions and cells infected with the packing mixture were spread onto 
plates according to Promega's instructions. 

25 

Isolation and mapping of hybridizing denes 

Phage DNA from plates with about 4000 plaques/plate was transferred to a membrane 
(Colony/Plaque Screen, New England Nuclear) according to the manufacturer's 
instructions. The membranes were hybridized as described above. Plaques which gave 
30 a signal in autoradiography, were picked up and the phages were eluted from them by 
incubating a plaque in 0.5 ml of SM-buffer for 2 houis. Because the plaque plates 
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were dense, the plaques were purified by infecting them into the host strain LE392 
(Promega) and hybridizing as above. 

From the purified clones phage DNA was prepared in 20 ml scale by infectmg the 
5 LE392 cells according to Promega's packing instructions. The DNA so obtained was 
digested with various restriction endonucleases to map the clones (Sambrook et aL, 
1989) and by hybridizing with different probes. The restriction map so obtained is 
given in Fig. 2. 



10 Transfer of the DNA fragments to S. Uvidans and detection of new compounds 

The fragment shown in the restriction map (Fig. 2) was transferred into S. Uvidans as 
£coRI-fragments (pSYl and pSY6) or as a fig/H-fragment (pSY15). X-clones were 
digested with EcoRl or 5f /Il-restriction enzyme and ligated to a plasmid made linear 
with the same enzyme and was transfonned by electroporation into coli or by 

15 protoplast transformation into S, Uvidans. Most of the inserts were first cloned into the 
plasmid pUC19 amplifying in E. coU, whereby as a host K coU strain XLl-Bluc vras 
used. pSY15 was cloned directly into the 5. Uvidans strain TK24. £ coU was used 
because by that way smaller amounts of phage-DNA could be used. The transformati- 
on efficacy of £ coU was 2*^10* transformants/jig DNA, when £. coU Pulscr 

20 Apparatus-clcctroporation device (Bio-Rad) was used with the following settings (200 
Ohm, 25 ^F, 1.4 kV). For electroporation the cells were treated as described in Dower, 
WJ. et qL (1988), and 0.1 cm cuvettes of Bio-Rad were used in transformation, the 
cell volume was 20 fj. 



25 X Uvidans strain TK24 was used as an intermediate host as the expression was 
believed to be successful only in X gaUlaeus strains. S, gaUlaeus is not at all 
transformable with DNA propagated in £ coU, Only the plasmid pSY15 caused 
modification in TK24 strain, which was noticed as brown colour on the ISP4 plate, 
when TK24 is nonnally rather colouriess or blue. Only the TK24 strain carrying the 

30 plasmid pSY15 caused formation of coloured products in the El-mcdium well suited 
for the production of anthracyclines. On the basis of thin layer chromatography the 
products of the recombinant strain TK24/pSY15 seemed to be alike to but not identical 



WEST 



wo 96/10581 



PCT/FI95rt)0537 



16 

with those produced by the mutant H036 (Ylihonko et al, 1994) producing the methyl 
ester of aklanone acid. With the cluent toluenexthyl acctatc:mcthanoi:formic acid 
(50:50:15:3) the following R,-values were obtained for these products; 

5 TK24/pSY15: 0.66; 0.60; 0^0 
H036: 0.67; 0.62; 0.51. 

These characteristics were confirmed to come from the pSY15 plasmid by rctransfor- 
ming the plasmid to 5. lividans TK24 strain. The transformants so obtained were also 
10 able to produce anthracyclinc precursors. When the recombinant strain was cultivated 
in El medium without selection pressure of the plasmid strain caused by thiostrcpton, 
the production of new compounds decreased. 

12 Localizing the PKS-genes 

15 

Sequencing of the hybridizing fragment 

From the £coRI-digest a 2 kb acti hybridizing fragment was obtained and it was 
sequenced. About 2 kb of DNA to the right according to the map (Fig. 2) was 
additionally sequenced. For sequencing 31 clones were prepared from restriction 
20 enzyme digestion sites to the vectors pUC18 and pUC19, being linearized with 
corresponding enzymes. 

To isolate plasmids for the sequencing reactions Magic/Wizard™ Minipreps DNA 
Purification System kit of Promega was used. E, coli XLl-Blue cells were cultivated 
25 overnight in 3 ml of LB-medium which contained 50 ^ig/ml of ampicillin, and the 
plasmids were isolated according to the manufacturer's instructions. 

DNA-sequencing was performed by using dideoxy chain termination method. For the 
sequencing reactions Deaza G/A ""Sequencing™ Mixes (Pharmacia) and TaqTrack* 
30 Sequencing Systems, Deaza (Promega) sequencing reagent series were used. 
Denaturation was always performed according to the instructions in the Pharmacia kit, 
(Method Q. When using the Phamiacia kit the primers were ligated according to the 
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Method C given in the instructions (Standard Annealing of Primer to Double-Stranded 
Template). When using the Promega kit the item "Sequencing Protocol Using Direct 
Incoiporation" of the manufacturer's instructions was followed. Deviating from the 
primer ligation temperature (37 recommended by the manufacturer the temperature 
5 of 45 **C was used to avoid the secondary structures caused by the high GC-content. 
The temperature was kept thereafter at 45 °C until the end of the reaction. As a 
radioactive label [a«S]dATP (NEN Products Boston, MA) was used. Most of the 
PKS-domain was sequenced with a universal primer (5'-d(GTTTTCCCAGTCAC- 
GACy-y) and with a reverse primer (5'-d(CAGGAAAC:AGCrATGAq-3* (pUCVMlS 
10 17*mcr Primers, Promega). When sequencing the longest fragments (500-600 bp) of 
the domain, and in order to define the sequences of such restriction sites which could 
not be "passed", six specific primers were used. The primers were prepared at the 
Department of Bioorganic Chemistry in the University of Turku. 

15 The sequencing gels were run by the Macrophor-systcm of Pharmacia, using a 4% 
thickness gradient gel. Running conditions: current 20 mA, voltage 2500 V. 

Sequence analysis 

From the PKS domain the DNA fragment with about 4134 bases (as given in the 
20 sequence listing) was sequenced, the analysis of which was performed by GCG- 
softwarc (Genetics Computer Group, GCG Package, Wisconsin USA). With the 
subprogram CODONPREFERENCE the open reading frames were sought from the 
sequence. The reading frames obtained were translated to the amino acid sequence and 
with the TFASTA-subprogram homologies to known sequences were sought. 

25 

According to the CODONPREFERENCE program the 4134 base DNA fragment as 
sequenced had altogether three open reading frames (ORFl, ORF2, ORF3) (ORF 1 is 
the fragment 359-1651 in SEQ ID N0:1 of the sequence listing, ORF2 is the fragment 
1648-2877 in the SEQ ID N0:4, and 0RF3 is the fragment 2937-3197 in the SEQ ID 
30 NO:l). In the beginning of each open reading frame a possible ribosome binding site 
was found (RBS). The functions of the genes were concluded by comparing the amino 
acid sequences translated from their base sequences to known sequences. So the 
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following similarities with the open reading frames of actinorhodin and tetraccnomycin 
PKS domains were obtained: ORFl (80%, 81%), 0RF2 (74%, 77%), 0RF3 (62%, 
62%), and on the basis of this we present the following functions to said genes: ORFl 
is ketoacylsynthase; ORF2 is the factor which effects on the chain length; 0RF3 is an 
5 acyl carrier protein. These three open reading frames are needed for a functional 
polyketide synthase. 

Upstream of the PKS domain about 6 kb DNA fragment was sequenced (kb = 1000 
bases). In this domain the following gene activities have been recognized on the basis 
10 of the sequence: (Fig, 2): regulatory genes, mono-oxygcnase, methyl transferase and 
ketoreductase. 

Example 2. Transfer of the genes into the strain S, galilaeus ATCC 31615 
and mutants thereof 

Flasmid pSY15 was isolated from 5: lividans strain TK24 and transformed into S, 
galilaeus mutant H039 and the DNA isolated therefrom further into other S. galilaeus 
mutants. The method used in the transformation of the S, galilaeus strain being 
modified from the transformation method used in the transformation of 5. lividans has 
been described earlier (Ylihonko, PL, Pro gradu-thesis. University of TXirku, 1986). 
For preparing protoplasts the cells were grown in SGYEME, to which 0.8% 
saccharose had been added. The plasmids were transformed successfully first to the 
mutant H039, whereby with 2 \ig of plasmid-DNA about 10 transformants were ob- 
tained. Because of a strong restriction barrier S. galilaeus is weakly transformable with 
foreign DNA but the transformation efficacy increases manyfold if the plasmid has 
been isolated from a 5. galilaeus strain. 

H039-transformants were first cultivated for about 5 days on an ISP4 plate, whereto 
thiostreptone had been added. The mycelium was inoculated in 50 ml of TSB nutrient 
30 broth (5 (ig/ml of thiostreptone added) and grown in a shaker for 5 days. The plasmid 
was isolated as described above and transformed into other mutants. Usually 200 to 
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500 ng of plasmid was used per one transformation, whereby 10 to 100 transformants 
were obtained. 

After regeneration the transformed mutant strains were spread onto ISP4 plates, 
5 whercfrom the mycelium was further transferred to El nutrient medium. To retain the 
plasmid thiostrepton was added to all nutrient media. El mycelium was incubated in 
a shaker (330 rpm, 30 ""Q and production was followed by taking after 3 days a 0.5 
ml sample of the mycelium daily for 3 to 5 days. The sample was buffered to pH 7 
with phosphate buffer and extracted with methanol-toluene mixture (1:1). In addition, 
10 part of the samples were acidified with IM HQ solution and extracted into tolucne- 
mcthanol. In El-cultivations both mutants and the 5. galilaeus wild strain were used 
as controls. By comparing the products on TLC the effects of the plasmid on the 
production were seen. 

15 The 5. galilaeus mutants used in transformations are listed above. Plasmid pSY15 
complemented, i.e. restored the producing ability of anthracyclincs or precursors 
thereof in the following mutants: H028, H061 and JH003. It did not affect the 
production profile of the mutants H036 and H039 to any appreciable extent. JH003, 
which does not produce coloured compounds in the conditions used, has been mutated 

20 fi^om the strain H054 and the transformant JH003/pSY15 was compared to the strain 
H054. H028 is also a non-producing mutant, which was obtained by mutating the wild 
strain 5. gaUlaeus ATCC 31615. So the wM strain was used as the control of the 
transformant H028/pSY15. Using the eluent toluenercthyl acetatermethanohformicacid 
(50:50:15:3) the following Revalues were obtained for the transformants and the host 

25 strains used as controls. 

H028/pSY15: (0.69); QM; 0.58; 001 
JH003/pSY15: 0.59; 0,50; 0.46; 031 
H061/pSY15: (0.69); 0.61; 05&; QM; QM 
30 5. galilaeus ATCC 31615: 0.23; 0.14; 0.11 
H054: 0.65; 0.60; 0.53; 0.48. 
H061: 050 (acid). 
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The product isolated in small scale was hydrolyzed by heating in IM hydrochloric acid 
at 80 for 0^ h. After hydrolysis the following R^-valucs were obtained for the 
aglycons or precursors thereof: 
H028/pSY15: 0.61 
5 JH003/pSY15: 0.61 
H061/pSY15: 0,61. 

Because all these mutants used have originally been produced from a S, galilaeus wild 
strain, aklavinone was used as comparison, being the aglyconc of aclacinomycins 
10 produced by S. galilaeus. In the clucnt used the Revalue 0.69 was obtained for 
aklavinone. In the products of transformants small amounts of aklavinone were also 
detected. 

Example 3. Production of anthracycllne precursors 

15 

3.1 Production of TK24/pSYlS product 

Ten 250 ml crlcnmeyer-flasks each containing 60 ml of El-mcdium were inoculated 
with 1 ml aliquots of the strain TK24/pSY15. The flasks were incubated in a shaker 
20 at 330 rpm at the temperature of 30 ^^C for about 3 days. From the finished mycclia 
production was confirmed by extracting 0.5 ml samples with a mixture of methanol 
and toluene (1:1). The products were compared to the standard by thin layer 
chromatography. 

25 The flasks were emptied into two 400 ml centrifuge tubes and centrifuged for 10 min 
at 3000 rpm. The supernatant was recovered. The precipitate was suspended by adding 
to each tube 50 ml of methanol. The tubes were recentrifuged for 10 min at 3000 rpm. 
Thz methanol solution was added into the supernatant. The precipitate was discarded. 
The solution was extracted with 2 • 100 ml of chloroform, whereby a strongly 

30 orange-yellow chloroform solution was obtained. The water phase was discarded. 



WEST 



wo 9^10581 



PCT/FI95/00537 



21 

Chlorofonn was evaporated on a water bath in a rotary evaporator. The orange- 
yellow, diy product was dissolved in 2 ml of chloroform. 

Tlie chloroform solutions were pipetted into a chromatography column of glass, 
5 equipped with a glass sinter, having a diameter of 2 cm and containing about 5 cm of 
silica suspended in chloroform (Kiesclgel 60, Merck). TTie column was eluted with 2 J 
ml aliquots of chloroform. Each fraction was collected into a separate test tube. 
Samples of each fraction were dropped on a thin layer and compared to the standards. 
Fractions containing individual compounds were pooled and evaporated into dryness. 

10 

NMR-spcctra of pure compounds were determined and the compounds were identified 
by comparing the spectra with analogical compouiids. In Fig. 4 the H-NMR-spcctrum 
of Compound I is given. 

15 32 Production of an agiycone in the strain H028/pSY15 

Ten 250 ml crienmcycr-flasks each containing 60 ml of El-medium were inoculated 
with 1 ml aliquots of the strain H028/pSY15. The flasks were incubated in a shaker at 
330 rpm at the temperature of 30 X for about 4 days. From the finished mycclia 
20 production was confirmed by extracting 0.5 ml samples with a mixture of methanol 
and toluene (1:1). The products were compared to the standards by thin layer chroma- 
tography. 

The flasks were emptied into two 400 ml centrifuge tubes and ccntrifugcd for 10 min 
25 at 3000 rpm. The supernatant vras recovered. The precipitate was suspended by adding 
to each tube 50 ml of methanol. The tubes were reccntrifuged for 10 min at 3000 rpm. 
The methanol solution was added into the supernatant. The precipitate was discarded. 
The solution was extracted with 2 ♦ 100 ml of chloroform, whereby a strongly yellow 
chlorofonn solution was obtained. TTic water phase was discarded, 

30 

Chloroform was evaporated on a water bath in a rotary evaporator. Hie yellow, dry 
product was dissolved in 2 ml of chloroform. 
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The chloroform solutions were pipetted into a chromatography column of glass, 
equipped with a glass sinter, having a diameter of 2 cm and containing about 5 cm of 
silica suspended in chloroform (Kiesclgel 60, Merck). The column was cluted with 2.5 
ml aliquots of chloroform. Each fraction was collected into a separate test tube. 
5 Samples of each fraction wwc dropped on a thin layer and compared to the standards. 
Fractions containing individual compounds were pooled and evaporated into dryness. 

NMR-spcctra of pure compounds were determined and the compounds were identified 
by comparing the spectra with analogical compounds. In Fig. 5 the H-NMR-spectrum 
10 of auramycinonc (Compound H) is given. 

Example 4. Biotransformation of hybrid products 

4.1 Biotransrormation of auramycinone in strain JH003 

15 

A 250 ml crlenmeycr-flask containing 60 ml of El-medium was inoculated with 1 ml 
of stram JH003. The flask was incubated in a shaker at 330 rpm at the temperature of 
30 for about 3 days. After two day's cultivation about 2 mg of auramycinonc was 
added into the flask. At 24 houis from this the production was confirmed by extracting 
20 a Q5 ml sample with the mixture of methanol and toluene (1:1). The products were 
compared to the standard by thin layer chromatography. 

The flask was emptied into two 60 ml centrifuge tube and ccntrifugcd for 10 min at 
3000 rpm. Hie supernatant was recovered. The precipitate was suspended by adding 
25 to each tube 10 ml of methanol. The tubes were rccentrifiigcd for 10 min at 3000 rpm. 
The methanol solution was added to the supernatant. The precipitate was discarded. 
The pooled solution was extracted with 2 * 20 ml of chloroform, whereby a strongly 
yellow chloroform solution was obtained. The water phase was discarded. 

30 Chloroform was evaporated on a water bath in a rotary evaporator. The yellow, dry 
product was dissolved in chloroform. On the basis of TIX the product was found to 
correspond to the products of the strain JH003/pSY15 (cf. Example 5.2). 
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Example 5. Production of hybrid anthracyclines 

5.1 Production of auramycinone-rbodosamine-deoxyfucose in strain 
H028/pSY15 

Ten 250 ml erlenmeyei-flasks each containing 60 ml of £1 -medium were inoculated 
with 1 ml aliquots of the strain H028/pSY15. The flasks were incubated in a shaker at 
330 ipm at the temperature of 30 for about 4 days. From the finished mycelia 
production was confirmed by extracting 0.S ml samples with a mixture of methanol 
and toluene (1:1). The products were compared to the standard by thin layer chroma- 
tography. 

The flasks were emptied into two 400 ml centrifuge tubes and centrifuged for 10 min 
at 3000 rpm. The supernatant was recovered. Tlie precipitate was suspended by adding 
to each tube 50 ml of methanol. The tubes were rccentrifuged for 10 min at 3000 rpm. 
The methanol solution was added into the supernatant. The precipitate was discarded. 
The pooled solution was extracted with 2 * 100 ml of chloroform, whereby a strongly 
yellow chloroform solution was obtained. Tlie water phase was discarded. 

Chloroform was evaporated on a water bath in a rotary evaporator. The yellow, dry 
product was dissolved in 2 ml of chloroform. 

The chloroform solutions were pipetted into a chromatography column of glass, 
equipped with a glass sinter, having a diameter of 2 cm and containing about 5 cm of 
silica suspended in chloroform (Kiesclgel 60, Merck). Tlie column was elutcd with 23 
ml aliquots of chloroform. Each fraction was collected into a separate test tube. 
Samples of each fraction were dropped on a thin layer and compared to the standards. 
Fractions containing individual compounds were pooled. 

NMR-spectia of pure compounds were determined and the compounds were identified 
by comparing the spectra with analogical compounds. In Fig. 6 the H-NMR-spectrum 
of auramycinone-rhodosaminc-deoxyfucose (Compound HI) is given. 
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S2 Production of auratnycinone-rhodinose-deoxyfucose in strain JH003/pS Y15 

Ten 250 ml erlenmeyer-flasks each containing 60 mi of El -medium were inoculated 
with 1 ml aliquots of the strain JH003/pSYlS. The flasks were incubated in a shaker 
5 at 330 ipm at the temperature of 30 for about 4 days. From the finished mycelia 
production was confirmed by extracting 0.S ml samples with a mixture of methanol 
and toluene (1:1). The products were compared to the standard by thin layer chroma- 
tography. 

10 The flasks were emptied into two 400 ml centrifuge tubes and centrifuged for 10 min 
at 3000 rpm. The supernatant was recovered. The precipitate was suspended by adding 
to each tube 50 ml of methanol. The tubes were recentrifuged for 10 min at 3000 rpm. 
The methanol solution was added into the supernatant. Tlie precipitate was discarded. 
The pooled solution was extracted with 2 * 100 ml of chloroform, whereby a strongly 

15 yellow chloroform solution was obtained. The water phase was discarded. 

Chloroform was evaporated on a water bath in a rotary evaporator. The yellow, dry 
product was dissolved in 2 ml of chloroform. 

20 The chloroform solutions were pipetted into a chromatography column of glass, 
equipped with a glass sinter, having a diameter of 2 cm and containing about 5 cm of 
silica suspended in chloroform (Kieselgel 60, Merck). The column was eluted with 23 
ml aliquots of chloroform:metbanol 100:10. Each fraction was collected into a separate 
test tube. Samples of each fraction were dropped on a thin layer and compared to a 

25 standard. Fractions containing individual compounds were pooled and evaporated into 
dryness. 

NMR-spectra of pure compounds were determined and the compounds were identified 
by comparing the spectra with analogical compounds. In Fig. 7 the H-NMR-spectrum 
30 of auramycinone-rhodinosc-deoxyfucose is given. 



WEST 



wo 96/10581 



PCT/FI»5rt)0537 



25 

Example 6, CbaracterizatioD of the products 
6.1 HPLC-nins 

The retention times of the compounds were detexmined at RP-18-column, with an 
5 cluent acetonitrile:mcthanol:potassium dihydrogen phosphate buffer (8.00 g/1, pH 3.0) 
5:2:3. The retention times of the compounds are: I: 4.63, H: 3.52, HI: 4.09 and IV: 
7.26. The structures of the compounds I - IV arc given in the Scheme I. 

62 NMR-spectra of the compounds 

10 H-NMR-spectra of some of the TK24/pSY15, H028/pSY15 and JH003/pSY15 
products were determined by Bruker 400 MHz NMR spectrometer in deuterium - 
chloroform. The spectra given by the compounds were compared to the spectra of 
known compounds, e.g. aclarubidn. The spectra obtained are given in Figs. 4 to 7. 

IS In all of the compounds the hydrogens in 1, 2 and 3-positions bound to each other and 
with same transitions are found. The singlet corresponding to the hydrogen in 11- 
position was found in ail compounds with the same transition. Additionally, the peaks 
given by the two aromatic bydroxyls can be seen. On the basis of the peaks of these 
six hydrogens the aromatic chromophore moieties are similar, and correspond e.g. the 

20 chromophore of aklavinone. 

In all of the compounds a singlet of the size of three hydrogens is found at about 3.7 
ppm corresponding to the methyl of methyl ester. Another singlet is found in all 
compounds at about 3.8 ppm, which corresponds to the 10-position hydrogen. The 
25 integral of this is of the size of one hydrogen in auramydnone and its glycosides and 
of the size of two hydrogens in Compound I. According to this Compound I suits to 
be a compound in which the fourth ring has not been closed. 

The region 4.7 to 6 ppm has in anthracyclines and in compounds related thereto 
30 hydrogens at 7-position and 1-position of the sugars. Auramydnone has in this region 
one peak, Compounds HI and IV have three peaks^ but in Compound I there are no 
peaks in this region. According to this auramycinone has no sugars and Compounds 
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III and rv have two sugars, whereas Compound I has no hydrogens in this region 
which suits with the kcto-form at position 7, 

Auramycinone and its glycosides have a three hydrogen singlet between 1.39 and 1.47 
5 ppm. This suits to be the methyl group of position 13, which is not bound to other 
hydrogens. This item distinguishes these compounds from aklavinone and its 
glycosides, wherein the side chain is ethyl. 

The 8-position CH2-hydrogcns of auramycinone and its glycosides give one doublet 
10 at 2.2 ppm and a double doublet at 2.6 ppm. In addition, in the spectra of Compounds 
ni and rv peaks corresponding to their sugars are found. 

Hie H-NMR results match well with the structures given in the Figures. 

15 

Deposited microorgaDisms 

Hie following microorganism was deposited in Deutsche Sammlung von Mikroor- 
ganismen und Zellkulturen (DSM), Mascheroder Weg 1 b, D-38124 Braunschweig, 
20 Germany 

Microorganism Deposition number DepositioD date 

Streptomyces lividans 
25 TK24/pSY15 DSM 9436 IS September 1994 
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Scheme I Stnictural fonnulas of the compounds obtained 
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SEQUENCE LISTING 



(1) GENERAL INFORHATION: 



(i) APPLICANT: 

(A) NAME I Galllaeua Oy 

(B) STREET: Elinantie 2 A 9 

(C) CITY: Turku 

(E) COUNTRY: Suomt 

(F) POSTAL CODE (ZIP): FIN-20510 

(11) TITLE OF INVENTION! Process for producing anthracyc lines 

and Intermediates thereof 

(ill) NUMBER OF SEQUENCES: 5 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy dislt 

(B) COMPUTER: IBM PC compatible 

(C> OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(ill) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

<B) STRAIN: Streptomyces nogalater ATCC 27451 

(ix) FEATURE: 

<A} NAME/KEY: CDS 

(B) LOCATION: 359.. 1651 

(D) OTHER INFORMATION: /note= -ORFl" 

( ix ) FEATURE : 

<A) NAME/KEY: CDS 

(B) LOCATION: 2937.. 3197 

(D) OTHER INFORMATION: /note- -ORF3" 

(ix) FEATURE: 

(A) NAME/KEY: mlac feature 

(B) LOCATION: 164B7.1651 

(D) OTHER INFORMATION: /note" "overlapping sequence in 
ORFl and ORF2*' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GAATTCGGCC GTACCCCGAC GGCCGATTCC TTACCCTTCC GGAGCGGCTT GCGGATCGCA 60 

GGAOGAAGTC CTCCCTCTCC CCCCATCGGG CGTCCGCTCT TTGTGACCGG TTCAC6AGTC 120 

GGGTTCCAGC GCTCCTCGAC TCAGGATCGA CCCCTTCCGC GGTA6CC6CC CCGCAGGAAC 180 
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CGCAAACCTT CCX^CGCCGGT CCC6CCGGGC TTCCCCCCAC CCGTCCATCC GTCATTGAGC 240 

TGATTTCGAG ACAGGACGCG CACTGTCACC ACGAGCCCTG TGCGGTTGAA GTCATCACCT 300 

GTCCGCGCAC AGGAACTTCA AGACGATCAA AGCCCCTAGT GAAGGGCATC TTCGACGA 358 

ATO AAG GAA TCC ATC AAC CGT CGC GTG GTC ATC ACC GGA ATA GGG ATC 406 
Mot Lyfi Glu Ser He Asn Arg Arg Val Val He Thr Gly He Gly He 
15 10 15 

GTC GCG CCC GAT GCC ACC GGG GTG AAA CCG TTC TGG GAT CTC CTG ACQ 454 
Val Ala Pro Asp Ala Thr Gly Val Lys Pro Phe Trp Asp Leu Leu Thr 
20 25 30 

GCC CGT CGC ACC GCG ACC CGG ACC ATC ACC GCC TTC GAT CCC TCT CCG 502 
Ala Gly Arg Thr Ala Thr Arg Thr He Thr Ala Phe Asp Pro Ser Pro 
35 40 45 

TTC CGT TCC CGC ATC GCC GCG GAA TGC GAT TTC GAC CCG CTT GCC GAA 550 
Phe Arg Ser Arg He Ala Ala Glu Cyo Asp Phe Asp Pro Leu Ala Glu 
50 55 60 

GCG CTG ACC CCC CAG CAG ATC CGC CGT ATG GAC CGG GCC ACG CAG TTC 598 
Gly Leu Thr Pro Gin Gin He Arg Arg Met Asp Arg Ala Thr Gin Phe 
65 70 75 80 

GCG GTC GTC AGC GCC CGG GAA AGC CTG CAG GAC AGC GGA CTC GAC CTC 646 
Ala Val Val Ser Ala Arg Glu Ser Leu Glu Aep Ser Gly Leu Asp Leu 
85 90 95 

GGC GCC CTG GAC GCC TCC CGC ACC GGC GTG GTC GTC GGC AGC GCG GTC 694 
Gly Ala Leu Asp Ala Ser Arg Thr Gly Val Val Val Gly Ser Ala Val 
100 105 110 

GGC TGC ACC ACG AGC CTG GAA GAG GAG TAC GCG CTC GTC AGC GAC AGC 742 
Gly Cys Thr Thr Ser Leu Glu Glu Glu Tyr Ala Val Val Ser Asp Ser 
115 120 125 

GGC CGG AAC TGG CTG GTC GAC GAC GGC TAC GCC GTA CCG CAC CTA TTC 790 
Gly Arg Asn Trp Leu Val Asp Asp Gly Tyr Ala Val Pro His Leu Phe 
130 135 140 

GAC TAC TTC GTG CCC AGC TCC ATC GCC GCC GAG GTG GCA CAC GAC CGC 838 
Asp Tyr Phe Val Pro Ser Ser He Ala Ala Glu Val Ala Hie Asp Arg 
1^5 150 155 160 

ATC GGC GCG GAG GGC CCC GTC AGC CTC GTG TCG ACC GGG TGC ACC TCG 886 
Ho Gly Ala Glu Gly Pro Val Ser Leu Val Ser Thr Gly Cys Thr Ser 
165 170 175 

GGC CTG GAC GCC CTC CGC CGC COG GCC GAC CTG ATC GCC GAG GGA COG 934 
Gly Leu Asp Ala Val Gly Arg Ala Ala Asp Leu He Ala Glu Gly Ala 
180 185 190 

GCG GAT GTG ATG CTG GCC GGT GCG ACC GAG GCG CCC ATC TCC CCC ATC 982 
Ala Asp Val Met Leu Ala Gly Ala Thr Glu Ala Pro He Ser Pro He 
195 200 205 

ACC GTG GCG TGC TTC GAT GCC ATC AAG GCG ACC ACC CCC CGC AAC GAC 1030 
Thr Val Ala Cys Phe Asp Ala He Lys Ala Thr Thr Pro Arg Asn Asp 
210 215 220 

ACG CCC GCC GAG GCG TCC CGT CCG TTC GAC CGC ACC AGG AAC GGG TTC 1078 
Thr Pro Ala Glu Ala Ser Arg Pro Phe Asp Arg Thr Arg Asn Gly Phe 
225 230 235 240 
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GTA CTC GGC GAG GGC GCT GCC CTC TTC GTC CTC GAG GAG TTC GAA CAC 1126 
Val Leu Gly Glu Gly Ala Ala Val Phe Val Leu Glu Glu Phe Glu His 
245 250 255 

GCC CGC CGC CGG GGC GCG CTC GTG TAC GCG GAG ATC GCC GGG TTC GCC 1174 
Ala Arg Arg Arg Gly Ala Lou Val Tyr Ala Glu He Ala Gly Phe Ala 
260 265 270 

ACT CGC TGC AAC GCC TTC CAC ATG ACC GGT CTO CGC CCG GAC GGG CGG 1222 
Thr Arg Cys Asn Ala Phe His Met Thr Gly Leu Arg Pro Asp Gly Arg 
275 280 285 

GAG ATG GCG GAG GCC ATC GGG GTG GCG CTC GCC CAG GCG GGC AAG GCG 1270 
Glu Met Ala Glu Ala He Gly Val Ala Leu Ala Gin Ala Gly Lys Ala 
290 295 300 

CCG GCT GAC GTC GAC TAC GTC AAC GCC CAC GGT TCC GGC ACC CGG CAG 1318 
Pro Ala Asp Val Asp Tyr Val Asn Ala His Gly Ser Gly Thr Arg Gin 
305 310 315 320 

AAT GAC CGT CAC GAG ACG GCG GCC TTC AAG CGC AGT CTC GGC GAC CAC 1366 
Asn Asp Arg His Glu Thr Ala Ala Phe Lys Arg Ser Leu Gly Asp His 
325 330 335 

GCC TAC CGG GTC CCG GTC AGC AGC ATC AAA TCC ATG ATC GGG CAC TCG 1414 
Ala Tyr Arg Val Pro Val Ser Ser He Lye Ser Met He Gly His Ser 
340 345 350 

CTG GGC GCG ATC GGC TCC CTG GAG ATC GCC GCC TCC GTG CTG GCC ATC 1462 
Leu Gly Ala He Gly Ser Leu Glu He Ala Ala Ser Val Leu Ala He 
355 360 365 

ACA CAC GAC GTG GTG CCG CCC ACC GCC AAT CTG CAC GAG CCG GAT CCC 1510 
Thr His Asp Val Val Pro Pro Thr Ala Asn Leu His Glu Pro Asp Pro 
370 375 380 

CAG TGC GAT CTG GAC TAC GTG CCG CTG CGG GCG CGT GCG TGC CCG GTG 1558 
Glu Cys Asp Leu Asp Tyr Val Pro Leu Arg Ala Arg Ala Cys Pro Val 
385 390 395 400 

GAC ACG GTG CTC ACG GTG GGC AGC GGG TTC GGC GGT TTC CAG AGC GCC 1606 
Asp Thr Val Leu Thr Val Gly Ser Gly Phe Gly Gly Phe Gin Ser Ala 
405 410 415 



ATG GTG CTG TGC GGT CCG GGC TOG CGG CGA AGC TCG GCC GCG TGACGGCCGC 1656 
Met Val Leu Cys Gly Pro Gly Ser Arg Gly Arg Ser Ala Ala 
420 425 430 



CGTGGTGGTG 


ACCGGTCTC6 


GCGTCGTCGC 


CCCCACCGGT 


CTCGGGGTGC 


GGGAGCACT6 


1718 


GTCGAGTACG 


GTCC6GGCGG 


CGTOGGOGAT 


CGGACC6GTC 


ACCC6GTTCG 


ACCCCGGCCG 


1778 


GTACCCCAGC 


AAACTGGCCG 


GAGAGGTGCC 


CGGTTTCGTC 


CCGGA6GACC 


ATCTGCCCAG 


1838 


CCGGCTGATG 


COGCAGACGG 


ACCATATGAC 


GCGCCTGGCG 


CTCGTCGCGG 


CGGACT66GC 


1898 


CTTCCAGGAC 


GCCGCCGTGG 


ACCCGTCGAA 


GCTGCCGGAG 


TACGGCGTCG 


GCGTGGTCAC 


1958 


CGCGAGTTCG 


GOGGGGGGGT 


TCGAATTCGG 


CCACCGCGAG 


CTGCAGAACC 


TGTGGAGCCT 


2018 


GGGCCCGCAG 


TACGTCAGCG 


CGTATCAGTC 


GTTCGCATGG 


TTCTATGCCG 


TGAACACCGG 


2078 


TCAGGTGTCC 


ATCCGGCACG 


GGCTGCGCGG 


CCCGGGCGGG 


GTGCTGGTGA 


CGGAACAGGC 


2138 


GGGCGGCCTG 


GACGCCCTTG 


GGCAGGCCCG 


GCGGCAGTTG 


CGGCGCGGAC 


TGCCGATGGT 


2196 


GGTCGCGGGA 


GCCGTTGACG 


GCTCGCCCTG 


CCCCTGGGGC 


TGGCTGGCGC 


AGCTCAGCTC 


2258 
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GGGOGGCCTC ACCACGTCGG ACGACCCGCG CCGGGCCTAT CTCCCGTTCG ACGCCGCAGC 2318 

CGGCGGACAC 6TGCCGGCAG AGGGOGGCGC CCTGCTCGTC CTGGAGACCXJ ACGAGTCGGC 2378 

CCGGGCGCGC GGGGTGACGC GGT6GTACGG GCGCATCGAT GGGTACCCCG CCACATTOGA 2438 

CCCCCCGCCC GGTTCGGGGC GCCCGCCGAA CCTGCTGCGG GCCGCGCAGG CGGCACTG6A 2498 

CGACGCGGAG GTCGGACCCG AGGCGGTCGA CGTGGTGTTC 6CGGACGCGT CCGGCACCCC 2558 

GGACGAGGAC GCG6CGGAGG CCGACGCGGT CCGGC6CCTG TTCGGACCGT ACGGCGTTCC 2618 

G6TGACGGCG CCGAAGACCA TGACCG6CCG CCTCAGCGOG GGCGGC6CGG CCCTCX5ACGT 2678 

G6CGACG6CG CTGCTGGCGC TGCGCGAGGG CGTCCTCCCG CC6ACGGTCA ACGTCTCCCG 2738 

GCOGCGGCCG GAGTACGAGC TGGACCTGGT GCTCGCCCCC CGGCGCACGC CCCTGOCCAO 2798 

GGCCCTGGTG CTCGCGCGGG GCCGGGGCGG GTTCAATGCG GCGATGGTCG TGGCGGGGCC 2858 

GCGC6CGGAG ACACGGTGAA GCGGCCCGGC GCAGCCGGAG CCGCGGTAAG AGGCCACGGA 2918 

AGAGAGAGGG ATGOGACG GTG AAG CAG CAG CTG ACG ACG GAA CGG CTC ATG 2969 
Val Lys Gin Gin Leu Thr Thr Glu Arg Leu Met 
is 10 

GAG ATC ATG CGG GAG TGC GCG GGC TAG GGT GAG GAC GTC GAC GCT CTG 3017 
Glu lie Met Arg Glu Cye Ala Gly Tyr Gly Glu Asp Val Asp Ala Leu 
15 20 25 

GGC GAC ACG GAC GGC GCC GAC TTC GCC GCA CTC GGC TAC GAC TCG CTG 3065 
Gly Asp Thr Asp Gly Ala Asp Phe Ala Ala Leu Gly Tyr Aep Ser Leu 
30 35 40 

GCG CTC CTG GAA ACG GCC GGC CGG CTC GAG CGC GAG TTC GGC ATC CAG 3113 
Ala Leu Leu Glu Thr Ala Gly Arg Leu Glu Arg Glu Phe Gly lie Gin 
45 50 55 

CTC GGT GAC GAG GTG GTC GCC GAC GCC AGG ACG CCT GCC GAG CTG ACC 3161 
Leu Gly Asp Glu Val Val Ala Asp Ala Arg Thr Pro Ala Glu Leu Thr 
60 65 70 75 

GCC CTG GTC AAC CGG ACC GTC GCC GAG GOG GCC TGACCCGGCC GGCCCAC6AG 3214 
Ala Leu Val Asn Arg Thr Val Ala Glu Ala Ala 
80 85 

AGCGGGGTGA CGCGTGTGTA CGGCACGGAA CTCACACA 3252 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 430 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Glu Ser He Asn Arg Arg Val Val He Thr Gly He Gly He 
15 10 15 

Val Ala Pro Asp Ala Thr Gly Val Lys Pro Phe Trp Asp Leu Leu Thr 
20 25 30 

Ala Gly Arg Thr Ala Thr Arg Thr He Thr Ala Phe Asp Pro Ser Pro 
35 40 45 
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Phe Arg Ser Arg He Ala Ala Glu Cys Aep Phe Asp Pro Leu Ala Glu 
50 55 60 

Gly Leu Thr Pro Gin Gin He Arg Arg Met Asp Arg Ala Thr Gin Phe 
65 70 75 BO 

Ala Val Val Ser Ala Arg Glu Ser Leu Glu Asp Ser Gly Leu Asp Leu 
85 90 95 

Gly Ala Leu Asp Ala Ser Arg Thr Gly Val Val Val Gly Ser Ala Val 
100 105 110 

Glv Cys Thr Thr Ser Leu Glu Glu Glu Tyr Ala Val Val Ser Asp Ser 
115 120 125 

Gly Arg Asn Trp Leu Val Asp Asp Gly Tyr Ala Val Pro His Leu Phe 
130 135 140 

Asp Tyr Phe Val Pro Ser Ser He Ala Ala Glu Val Ala His Asp Arg 
145 150 155 160 

He Gly Ala Glu Gly Pro Val Ser Leu Val Ser Thr Gly Cys Thr Ser 
165 170 175 

Gly Leu Asp Ala Val Gly Arg Ala Ala Asp Leu He Ala Glu Gly Ala 
180 185 190 

Ala Asp Val Met Leu Ala Gly Ala Thr Glu Ala Pro He Ser Pro He 
195 200 205 

Thr Val Ala Cys Phe Asp Ala He Lys Ala Thr Thr Pro Arg Asn Asp 
210 215 220 

Thr Pro Ala Glu Ala Ser Arg Pro Phe Asp Arg Thr Arg Asn Gly Phe 
225 230 235 240 

Val Leu Gly Glu Gly Ala Ala Val Phe Val Leu Glu Glu Phe Glu Hie 
245 250 255 

Ala Arg Arg Arg Gly Ala Leu Val Tyr Ala Glu He Ala Gly Phe Ala 
260 265 270 

Thr Arg Cys Asn Ala Phe His Met Thr Gly Leu Arg Pro Asp Gly Arg 
275 280 285 

Glu Met Ala Glu Ala He Gly Val Ala Leu Ala Gin Ala Gly Lys Ala 
290 295 300 

Pro Ala Asp Val Asp Tyr Val Asn Ala His Gly Ser Gly Thr Arg Gin 
305 3X0 315 320 

Asn Asp Arg His Glu Thr Ala Ala Phe Lys Arg Ser Leu Gly Asp His 
325 330 335 

Ala Tyr Arg Val Pro Val Ser Ser He Lys Ser Met He Gly His Ser 
340 345 350 

Leu Gly Ala He Gly Ser Leu Glu He Ala Ala Ser Val Leu Ala He 
355 360 365 

Thr His Asp Val Val Pro Pro Thr Ala Asn Leu His Glu Pro Asp Pro 
370 375 380 

Glu Cys Asp Leu Asp Tyr Val Pro Leu Arg Ala Arg Ala Cys Pro Val 
385 390 395 400 

Asp Thr Val Leu Thr Val Gly Ser Gly Phe Gly Gly Phe Gin Ser Ala 
405 410 415 
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Met Val Leu Cys Gly Pro Gly Ser Arg Gly Arg Ser Ala Ala 
420 425 430 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE] amino acid 
(D) TOPOLOGY 1 linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Val Lye Gin Gin Leu Thr Thr Glu Arg Leu Met Glu He Met Arg Glu 
15 10 15 

Cya Ala Gly Tyr Gly Glu Asp Val Asp Ala Leu Gly Asp Thr Asp Gly 
20 25 30 

Ala Asp Phe Ala Ala Leu Gly Tyr Asp Ser Leu Ala Leu Leu Glu Thr 
35 40 45 

Ala Gly Arg Leu Glu Arg Glu Phe Gly He Gin Leu Gly Asp Glu Val 
50 55 60 

Val Ala Asp Ala Arg Thr Pro Ala Glu Leu Thr Ala Leu Val Asn Arg 
65 70 75 80 

Thr Val Ala Glu Ala Ala 
85 



(2) INFORMATION FOR SEQ ID NOx 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH! 3252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Stroptomycea nogalater ATCC 27451 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1648.. 2877 

(D) OTHER INFORMATION: /note- '•0RF2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAATTCGGCC GTACCCCGAC GGCCGATTCC TTACCCTTCC G6AGCG6CTT GCGGATCGCA 60 

G6ACGAAGTC CTCCCTCTCC CCCCATC6GG CGTCCGCTCT TTGTGACCGG TTCAOGAGTC 120 

GGGTTCCAGC GCTCCTCGAC TCAGGATCGA CCCCTTCCGC GGTAGCCGCC CCGCAGGAAC 180 

CGCAAACCTT CCGCGCCGGT CCCGCCGGGC TTCGCCGCAC CCGTCCATCC GTCATTGAGC 240 
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TGATTTCGAG 


ACAGGACGCG 


CACTGTCACC 


ACGAGCCCTG 


TGCGGTTGAA 


GTCATCACCT 


300 


GTCCGCGCAC 


AGGAACTTCA 


AGACGATCAA 


AGCCCCTAGT 


GAAGGGCATC 


TTC6ACGAAT 


360 


GAAGGAATCC 


ATCAACCGTC 


GCGTGGTCAT 


CACCGGAATA 


GGGATCGTCG 


CGCCCGATGC 


420 


CACCGGGGTG 


AAACOGTTCT 


GGGATCTGCT 


GACGGCCGGT 


OGCACCGCGA 


CCCGGACCAT 


480 


CACCGCCTTC 


CATCCXTCTC 


CGTTCCGTTC 


CCGCATCGCC 


GCGGAATGCG 


ATTTCGACCC 


540 


GCTTGCCGAA 


GGGCTGACCC 


CCCAGCAGAT 


CCGGCGTATG 


GACCGGGCCA 


CGCAGTTCGC 


600 


GGTCGTCAGC 


GCCCGGGAAA 


GCCTGGAGGA 


CAGCGGACTC 


GACCTCX^GOG 


CCCTGGACGC 


660 


CTCCCGCACC 


GGCGTGGTCG 


TCGGCAGCGC 


GGTCGGCTGC 


ACCACGAGCC 


TGGAAGAGGA 


720 


GTACGCGGTC 


GTCAGCGACA 


GCGGCCGGAA 


CTGGCTGGTC 


GACGACGGCT 


ACGCCGTACC 


780 


GCACCTATTC 


GACTACTTCG 


TGCCCAGCTC 


CATCGCCGCC 


GAGGTGGCAC 


ACGACCGCAT 


840 


CGGCGCGGAG 


GGCCCOGTCA 


6CCTCGTGTC 


GACCGGGTGC 


ACCTCGGGCC 


TGGACGCCGT 


900 


GGGCCGCGCG 


GCC6ACCTGA 


TCGCCGAG6G 


AGCGGCGGAT 


GTGATGCTGG 


CCGGTGCGAC 


960 


CGAGGCGCCC 


ATCTCCCCCA 


TCACCGTGGC 


GTGCTTCGAT 


GCCATCAAGG 


CGACCACCCC 


1020 


CCGCAACGAC 


ACGCCCGCCG 


AGGC6TCCCG 


TCCGTTCGAC 


CGCACCAGGA 


ACGGGTTCGT 


1080 


ACTCGGCGAG 


GGCGCTGCCG 


TGTTCGTCCT 


GGAGGAGTTC 


GAACACGCGC 


GCCGCCGGGG 


1140 


CGCGCTCGTG 


TACGCGGAGA 


TCGCCGGGTT 


CGCCACTCGC 


TGCAACGCCT 


TCCACATGAC 


1200 


CGGTCTGCGC 


CCGGACGGGC 


GGGAGATGGC 


GGAGGCCATC 


GGGGTGGCGC 


TCGCCCAGGC 


1260 


GGGCAAGGCG 


CCGGCTGACG 


TCGACTACGT 


CAACGCCCAC 


GGTTCCGGCA 


CCCGGCAGAA 


1320 


TGACCGTCAC 


GAGACGGCGG 


CCTTCAAGOG 


CAGTCTOGGC 


GACCACGCCT 


ACCGGGTCCC 




GGTCAGCAGC 


ATCAAATCCA 


TGATOGGGCA 


CTCGCTGGGC 


GCGATCGGCT 


CCCTGGA6AT 


1440 


CGCOSCCTCC 


GTGCTGGCCA 


TCACACACGA 


CGTGGTGCCG 


CCCACCGCCA 


ATCTGCACGA 


1500 


GCCGGATCCC 


GAGTGCGATC 


TGGACTACGT 


GCCGCTGCGG 


GCGCGTGCGT 


GCCCGGTGGA 


1560 


CACGGTGCTC 


AC6GTGGGCA 


GCGGGTTC6G 


CGGTTTCCAG 


AGCGCCATGG 


TGCTGTGCGG 


1620 


TCCGGGCTC6 


CGGGGAAGGT 


CGGCCGC GTG ACG GCC GCC GTG GTG GTG ACC 


1671 



Val Thr Ala Ala Val Val Val Thr 

1 5 



GGT CTC GGC GTC GTC GCC CCC ACC GGT CTC GGG GTG CGC GAG CAC TGG 1719 
Gly Leu Gly Val Val Ala Pro Thr Gly Leu Gly Val Arg Glu Hia Trp 
10 15 20 

TCG AGT ACG GTC CGG GGG GCG TCG GCG ATC GGA CCG GTC ACC CGG TTC 1767 
Ser Ser Thr Val Arg Gly Ala Ser Ala lie Gly Pro Val Thr Arg Phe 
25 30 35 40 

GAC GCC GGC CGG TAC CCC AGC AAA CTG GCC GGA GAG GTG CCC GGT TTC 1815 
Aep Ala Gly Arg Tyr Pro Ser Lys Leu Ala Gly Glu Val Pro Gly Phe 
45 50 55 

GTC CCG GAG GAC CAT CTG CCC AGC CGG CTG ATG CCG CAG ACG GAC CAT 1863 
Val Pro Glu Asp His Leu Pro Ser Arg Leu Met Pro Gin Thr Aep His 
60 65 70 
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ATG ACG CGC CTG GCC CTC GTC GCG GCG 6AC TGG CCC TTC CAG GAC GCC 1911 

Met Thr Arg Leu Ala Leu Val Ala Ala Asp Trp Ala Phe Gin Aap Ala 
75 80 85 

GOC GTG GAC CCG TCG AAG CTG CCG GAG TAG GGC GTC GCC GTG GTC ACC 1959 
Ala Val Asp Pro Ser Lye Leu Pro Glu Tyr Gly Val Gly Val Val Thr 
90 9S 100 

GCG ACT TCG OCO CCC GGG TTC CAA TTC GCC CAC CCC GAG CTG CAG AAC 2007 
Ala Ser ser Ala Gly Gly Phe Glu Phe Gly His Arg Glu Leu Gin Asn 
105 110 115 120 

CTG TGG AGC CTG GGC CCG CAG TAC GTC AGC GCG TAT CAG TCG TTC CCA 2055 
Leu Trp Ser Leu Gly Pro Gin Tyr Val Ser Ala Tyr Gin Ser Phe Ala 
125 130 135 

TGC TTC TAT GCC GTG AAC ACC GGT CAG GTG TCC ATC CCG CAC GGC CTC 2103 
Trp Phe Tyr Ala Val Asn Thr Gly Gin Val Ser He Arg Hie Gly Leu 
140 145 150 

CGC GGC CCC CCC CCG GTG CTG GTG ACC CAA CAG GOG GGC GGC CTG GAC 2151 
Arg Gly Pro Gly Gly Val Leu Val Thr Glu Gin Ala Gly Gly Leu Asp 
155 160 165 

GCC CTT GGG CAG CCC CCC CGC CAG TTG CGC CGC CCA CTG CCG ATG CTG 2199 
Ala Leu Gly Gin Ala Arg Arg Gin Leu Arg Arg Gly Leu Pro Met Val 
170 175 180 

CTC GCG GGA GCC CTT CAC GGC TCG CCC TGC CCC TGG CCC TGG GTC GCG 2247 
Val Ala Gly Ala Val Asp Gly Ser Pro Cys Pro Trp Gly Trp Val Ala 
185 190 195 200 

CAG CTC AGC TCG GGC GGC CTC AGC ACG TCG GAC GAC COG CGC CGG GCC 2295 
Gin Leu Ser Ser Gly Gly Leu Ser Thr Ser Asp Asp Pro Arg Arg Ala 
205 210 215 

TAT CTG CCG TTC GAC GCC GCA GCC GCC CCA CAC CTC CCG GGA GAG COO 2343 
Tyr Leu Pro Phe Asp Ala Ala Ala Gly Gly His Val Pro Gly Glu Gly 
220 225 230 

GGC GCC CTG CTC GTC CTG GAG AGC GAC GAG TOO CCC CCC CCC CCC GGG 2391 
Gly Ala Leu Leu Val Leu Glu Ser Asp Glu Ser Ala Arg Ala Arg Gly 
235 240 245 

CTG ACG CGG TGG TAC GGG CGC ATC OAT GCG TAC GCC GCC ACA TTC GAC 2439 
Val Thr Arg Trp Tyr Gly Arg He Asp Gly Tyr Ala Ala Thr Phe Asp 
250 255 260 

CCC CCG CCC GGT TCG GGG CGC CCG CCC AAC CTC CTG CGG GCC GCG CAG 2487 
Pro Pro Pro Gly Ser Gly Arg Pro Pro Asn Leu Leu Arg Ala Ala Gin 
265 270 275 280 

GCG GCA CTC CAC CAC CCG GAG GTC GGA CCC GAC GOC GTC CAC GTC GTG 2535 
Ala Ala Leu Asp Asp Ala Glu Val Gly Pro Glu Ala Val Asp Val Val 
285 290 295 

TTC GCG GAC GCG TCC CGC ACC CCG GAC GAG GAC GCG GCG GAG GCC CAC 2583 
Phe Ala Asp Ala Ser Gly Thr Pro Asp Glu Asp Ala Ala Glu Ala Asp 
300 305 310 

GCG GTG CGG CGC CTG TTC GGA CCG TAC GGC GTT CCG GTG ACG CCG CCC 2631 
Ala Val Arg Arg Leu Phe Gly Pro Tyr Gly Val Pro Val Thr Ala Pro 
315 320 325 

AAC ACC ATG ACC GGC CGC CTC AGC GCG CGC CGC GCG GCC CTC GAC GTC 2679 
Lys Thr Met Thr Gly Arg Leu Ser Ala Gly Gly Ala Ala Leu Asp Val 
330 335 340 
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GCG ACG GCG CTC CTG GCG CTG CGC GAG CGC GTC GTC CCG CCC ACG CTC 2727 
Ala Thr Ala Leu Leu Ala Leu Arg Glu Gly Val Val Pro Pro Thr Val 
345 350 355 360 

AAC CTC TCC CGG CCG CGG CCG GAG TAG GAG CTG GAC CTG GTC CTC 6CC 2775 
Asn Val ser Arg Pro Arg Pro Glu Tyr Glu Leu Aep Leu Val Leu Ala 
365 370 375 

CCC CGG CGC ACO CCC CTG GCC AGG GCC CTG GTC CTC GCG CGG GGC CGG 2823 
Pro Arg Arg Thr Pro Leu Ala Arg Ala Leu Val Leu Ala Arg Gly Arg 
380 385 390 

GGC GGC TTC AAT CCC GCG ATC GTC CTG GCG GCG CCG CCC CCC GAG ACA 2871 
Gly Gly Phe Asn Ala Ala Met Val Val Ala Gly Pro Arg Ala Glu Thr 
395 400 405 

CGC TGAAGCGGCC CGGCGCAGCC CCAGCCGCGG TAAGAGGCCA CCCAACACAC 2924 
Arg 

410 

AGGGATGCCA CCCTGAAGCA GCAGCTGACG ACCCAACCGC TCATGGAGAT CATGCCCGAG 2984 

TGCGCGGGCT ACGCTGAGGA CGTC6ACGCT CTGGGCGACA CGGACGGCGC CGACTTCCCC 3044 

GCACTCGGCT ACGACTCGCT CCCGCTCCTC GAAACGGCCG GCCGGCTCGA GCGCCACTTC 3104 

CGCATCCACC TCGCTGACGA GGTGGTCGCC GACGCCAGGA CGCCTGCCGA GCTGACCCCC 3164 

CTCGTCAACC GCACCGTGGC CGAGCCGGCC TGACCCCCCC GCCCCACGAC AGCGGGCTGA 3224 

CGCGTCTGTA CGGCACGGAA CTCACACA 3252 

(2) INFORMATION FOR SEQ ID NOi 5: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 409 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Val Thr Ala Ala Val Val Val Thr Gly Leu Gly Val Val Ala Pro Thr 
15 10 15 

Gly Leu Gly Val Arg Glu Hie Trp Ser Ser Thr Val Arg Gly Ala Ser 
20 25 30 

Ala He Gly Pro Val Thr Arg Phe Asp Ala Gly Arg Tyr Pro Ser Lys 
35 40 45 

Leu Ala Gly Glu Val Pro Gly Phe Val Pro Glu Aap Hia Leu Pro Ser 
50 55 60 

Arg Leu Met Pro Gin Thr Asp Hie Met Thr Arg Leu Ala Leu Val Ala 
65 70 75 80 

Ala Asp Trp Ala Phe Gin Asp Ala Ala Val Asp Pro Ser Lys Leu Pro 
85 90 95 

Glu Tvr Gly Val Gly Val Val Thr Ala Ser Ser Ala Gly Gly Phe Glu 
' 100 105 110 

Phe Gly His Arg Glu Leu Gin Asn Leu Trp Ser Leu Gly Pro Gin Tyr 
115 120 125 
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Val Ser Ala Tyr Gin Ser Phe Ala Trp Phe Tyr Ala Vai Asn Thr Gly 
130 135 140 

Oln Val Ser He Arg His Gly Leu Arg Gly Pro Gly Gly Val Leu Val 
^45 150 155 160 

Thr Glu Gin Ala Gly Gly Leu Asp Ala Leu Gly Gin Ala Arg Arg Gin 
165 170 175 

Leu Arg Arg Gly Leu Pro Met Val Val Ala Gly Ala Val Asp Gly Ser 
180 185 190 

Pro Cys Pro Trp Gly Trp Val Ala Gin Leu Ser Ser Gly Gly Leu Ser 
195 200 205 

Thr Ser Asp Asp Pro Arg Arg Ala Tyr Leu Pro Phe Asp Ala Ala Ala 
210 215 220 

Gly Gly His Val Pro Gly Glu Gly Gly Ala Leu Leu Val Leu Glu Ser 
225 230 235 240 

Asp Glu Ser Ala Arg Ala Arg Gly Val Thr Arg Trp Tyr Gly Arg He 
245 250 255 

Asp Gly Tyr Ala Ala Thr Phe Asp Pro Pro Pro Gly Ser Gly Arg Pro 
260 265 270 

Pro Asn Leu Leu Arg Ala Ala Gin Ala Ala Leu Asp Asp Ala Glu Val 
275 280 285 

Gly Pro Glu Ala Val Asp Val Val Phe Ala Asp Ala Ser Gly Thr Pro 
290 295 300 

Asp Glu Asp Ala Ala Glu Ala Asp Ala Val Arg Arg Leu Phe Gly Pro 
305 310 315 320 

Tyr Gly Val Pro Val Thr Ala Pro Lys Thr Met Thr Gly Arg Leu Ser 
325 330 335 

Ala Gly Gly Ala Ala Leu Asp Val Ala Thr Ala Leu Leu Ala Leu Arg 
340 345 350 

Glu Gly Val Val Pro Pro Thr Val Asn Val Ser Arg Pro Aro Pro Glu 
355 360 365 

Tyr Glu Leu Asp Leu Val Leu Ala Pro Arg Arg Thr Pro Leu Ala Arg 
370 375 380 

Ala Leu Val Leu Ala Arg Gly Arg Gly Gly Phe Asn Ala Ala Met Val 
385 390 395 400 

Val Ala Gly Pro Arg Ala Glu Thr Arg 
405 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCr Rule \3bis) 



A, Iht mdications made below relalc lo the miooorganisra tefmed to in tbe descripiion 
26 .line 25 



on page 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are idcntifled on an additional ibeet Q 



Name of depositary institution 

Deutsche Saminlung von Mikroorganismen and Zcllkulturen (DSM) 



Address of deposiUry institution (inchdiitg postal code and country) 



Mascheiodcr Weg 1 b, D-38124 Braunschweig, Germany 



Date of deposit 

15 September 1994 


Accession Number 

DSM 9436 


C. ADDITIONAL INDICATIONS (imvebhmkifnoiappfieaUe) This infonnation is ooniinued on an additional sheet ^ 



In respect of those designations in which a European patent or a patent in Finland or Norway is 
sought, a sample of the deposited microorganism will be made available until the publication of the 
mention of the grant of the European patent or the corresponding infomaation concerning the patent in 
Finland or Norway or until the date on which the application has been refused or withdrawn or is 
deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person re- 
questing the sample (Rule 28(4) EPC and the corresponding regulations in Finland and Norway). 



D. DESIGNATED STATES FOR WmCH INDICATIONS ARE MADE Of the indleatioiusn net f^raOdesiptaud Slates) 



E. SEPARATE FURNISHINC OF INDICATIONS (/«n« btank If tun opplieahk) 



The indications | isted below will be submitted to the Interna liona I Bureau \zicT(spteify the general nettsre of theindicatwme,g^ 'Aemom 
Hitmber of Deposit') 



For receiving OfTice use only 



12 was received with tbe international application 



Authorized oCCcer 



For Interna liooal Bureau use only 



n "Hii* sheet was received by the International Bureau on: 



Authorized oCGcer 



Form PCr/ROA34 (July 1992) 
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Indications relating to deposited microorganisms 
Continuation to C. ADDITIONAL INDICATIONS 

DSM 9436 

When designating Australia, in accordance with regulation 3.25 of the Patents 
Regulations (Australia Statutory Rules 1991 No. 71), samples of materials deposited 
in accordance with the Budapest Treaty in relation to this Patent Request are only to 
be provided before: the patent is granted on the application; or the application has 
lapsed or been withdrawn or refused; to a person who is: a skilled addressee without 
an interest in the invention; and nominated by a person who makes a request for the 
furnishing of those samples. 
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Claims 

1. Isolated and purified DNA-fragment, which is a gene fragment of the anthracyclinc 
biosynthetic pathway of the bacterium Streptomyces nogalater being included in an 

5 ac//-hybridi2ing 12 kb Bg/II-fragment of 5. nogalater genome. 

2. DNA-fragment according to claim I, which comprises the nucleotide sequence 
given in SEQ ID N0;1 or a functional part thereof. 

10 3. Recombinant-DNA-construction» which comprises the DNA-fragment according to 
claim 1 or 2, included in a plasmid which can be transferred into a Streptomyces- 
bacterium and is copied therein. 

4. Recombinant-DNA-construction according to claim 3 which is the plasmid pSY15, 
15 the structure of which is given in Fig. 3, and which was deposited in S. lividans strain 

TK24/pSY15 with the deposition number DSM 9436. 

5. Process for the production of anthracydines and precursors thereof, comprising 
transferring the DNA-fragment according to claim 1 or 2 into a foreign Streptomyces 

20 host, cultivating the recombinant strain obtained, and isolating the products formed. 

6. Process according to claim 5, wherein the Streptomyces host is S. lividans or S. 
galilaeus, 

25 7. Process according to claim 5 for producing auramycinone or glycosides thereof, 
comprising transferring the DNA-fragment according to claim 2 into Streptomyces 
galilaeus host or a mutant thereof, cultivating the recombinant strain so obtained and 
isolating auramycinone or a glycoside thereof as formed. 

30 8. Process according to claim 7, wherein the Streptomyces galilaeus host is the mutant 
strain H028 of 5. galilaeus ATCC 31615. 
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9. An anthracydinc precursor which is obtainable according to claim 5 and has the 
following formula I 
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A) (starting unit: propionate) 

Q COCH 




OH O OH 0 
2-OH-aklanone acid (H061) 

O ^ COOCH, 




Methyl ester of aklanone acid (H036) 





OH O OH OH 
Aklavinone (H039) 



OH O OH 6 

OrN(CH3)2 S. galilaeits 



\ 



Aclacinomycin 
Q OH 



O OH COOCH, 





S. peucetius 



8-rhodomycinone 

i 



Daunorubicin 



Fig. lA/1 
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8-rhodomycinone 



1 



O OH COOCH 




AhN(CH3)2 



O OH O 



N(CH3)2 
I OH 




OH O OH O 



OhN(CH3)2 



s. 



Betaclamycin (H038/EB3) Rhodomycin 



purpurascens 



Fig. lA/2 
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B) (starting unit: acetate) 




OCH3 ^ nogalater 
Nogalamycin 0CH3 



Fig. IB 
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= pIJ486 (6.2 kb) 

= pSY15 insert (12 kb) 



Fig. 3 
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